Automatic Classification of Musical Artists based on Web-Data

نویسندگان

  • Peter Knees
  • Elias Pampalk
  • Gerhard Widmer
چکیده

The organization of music is one of the central challenges in times of increasing distribution of digital music. A well-tried means is the classification in genres and/or styles. In this paper we propose the use of text categorization techniques to classify artists present on the Internet. In particular, we retrieve and analyze webpages ranked by search engines to describe artists in terms of word occurrences on related pages. To classify artists we primarily use support vector machines. Based on a previously published paper and on a master’s thesis, we present experiments comprising the evaluation of the classification process on a taxonomy of 14 genres with altogether 224 artists, as well as an estimation of the impact of daily fluctuations in the Internet on our approach, exploiting a long-term study over a period of almost one year. On the basis of these experiments we study (a) how many artists are necessary to define the concept of a genre, (b) which search engines perform best, (c) how to formulate search queries best, (d) which overall performance we can expect for classification, and finally (e) how our approach is suited as a similarity measure for artists.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Uncovering Affinity of Artists to Multiple Genres from Social Behaviour Data

In organisation schemes, musical artists are commonly identified with a unique ‘genre’ label attached, even when they have affinity to multiple genres. To uncover this hidden cultural awareness about multi-genre affinity, we present a new model based on the analysis of the way in which a community of users organise artists and genres in playlists. Our work is based on a novel dataset that we ha...

متن کامل

A Deeper Look into Web-based Classification of Music Artists

We extensively evaluate a Web-based approach to automatic classification of musical artists into genres and explore several directions for improvements. The investigated steps to advance the current approach include the finding of optimal query terms to initially retrieve a high fraction of relevant pages from the Web and furthermore the development of a filtering technique to identify and excl...

متن کامل

Cultural Orientation: Classifying Subjective Documents by Cociation Analysis

This paper introduces a simple method for estimating cultural orientation, the affiliation of hypertext documents in a polarized field of discourse. Using a probabilistic model based on cocitation information, two experiments are reported. The first experiment tests the model’s ability to discriminate between leftand right-wing documents about politics. In this context the model is tested on th...

متن کامل

MusicWeb: Music Discovery with Open Linked Semantic Metadata

This paper presents MusicWeb, a novel platform for music discovery by linking music artists within a web-based application. MusicWeb provides a browsing experience using connections that are either extra-musical or tangential to music, such as the artists’ political a liation or social influence, or intra-musical, such as the artists’ main instrument or most favoured musical key. The platform i...

متن کامل

Web Services for Music Information Retrieval

In the emerging world of networked and distributed digital libraries, the Web services framework will be a key to facilitating simple inter-application communication between them. Yet, despite the popularity of Web services in the business sector and their seemingly obvious applicability to the digital library domain, and to MIR in particular, the adoption of these new protocols has not been wi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005